Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 1132720070050020068
Genomics & Informatics
2007 Volume.5 No. 2 p.68 ~ p.76
A Statistical Analysis of SNPs, In-Dels, and Their Flanking Sequences in Human Genomic Regions
Shin Seung-Wook

Kim Young-Joo
Kim Byung-Dong
Abstract
Due to the increasing interest in SNPs and mutational hot spots for disease traits, it is becoming more important to
define and understand the relationship between SNPs and their flanking sequences. To study the effects of flanking
sequences on SNPs, statistical approaches are necessary to assess bias in SNP data. In this study we mainly applied
Markov chains for SNP sequences, particularly those located in intronic regions, and for analysis of in-del data. All of the pertaining sequences showed a significant tendency to generate particular SNP types. Most sequences flanking SNPs had lower complexities than average sequences, and some of them were associated with microsatellites. Moreover, many Alu repeats were found in the flanking sequences. We observed an elevated frequency of single-base-pair repeat-like sequences, mirror repeats, and palindromes in the SNP flanking sequence data. Alu repeats are hypothesized to be
associated with C-to-T transition mutations or A-to-I RNA editing. In particular, the in-del data revealed an association
between particular changes such as palindromes or mirror repeats. Results indicate that the mechanism of induction
of in-del transitions is probably very different from that which is responsible for other SNPs. From a statistical perspective, frequent DNA lesions in some regions probably have effects on the occurrence of SNPs.
KEYWORD
single nucleotide polymorphisms, SNPs, Intron, Markov chain
FullTexts / Linksout information
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI) KoreaMed